7. Special Relativity 



Although Newtonian mechanics gives an excellent description of Nature, it is not uni- 
versally valid. When we reach extreme conditions — the very small, the very heavy or 
the very fast — the Newtonian Universe that we're used to needs replacing. You could 

say that Newtonian mechanics encapsulates our common sense view of the world. One 
of the major themes of twentieth century physics is that when you look away from our 
everyday world, common sense is not much use. 

One such extreme is when particles travel very fast. The theory that replaces New- 
tonian mechanics is due to Einstein. It is called special relativity. The effects of special 
relativity become apparent only when the speeds of particles become comparable to 
the speed of light in the vacuum. The speed of light is 

c = 299792458 ms'^ 

This value of c is exact. It may seem strange that the speed of light is an integer 
when measured in meters per second. The reason is simply that this is taken to be 
the definition of what we mean by a meter: it is the distance travelled by light in 
1/299792458 seconds. For the purposes of this course, we'll be quite happy with the 
approximation c ~ 3 x 10^ ms~^. 

The first thing to say is that the speed of light is fast. Really fast. The speed of 
sound is around 300 ms^^; escape velocity from the Earth is around 10"^ ms^^; the 
orbital speed of our solar system in the Milky Way galaxy is around 10^ ms~^. As we 
shall soon see, nothing travels faster than c. 

The theory of special relativity rests on two experimental facts. (We will look at the 
evidence for these shortly). In fact, we have already met the first of these: it is simply 
the Galilean principle of relativity described in Section 1. The second postulate is more 
surprising: 

• Postulate 1: The principle of relativity: the laws of physics are the same in all 
inertial frames 

• Postulate 2: The speed of light in vacuum is the same in all inertial frames 

On the face of it, the second postulate looks nonsensical. How can the speed of light 
look the same in all inertial frames? If light travels towards me at speed c and I run 
away from the light at speed f , surely I measure the speed of light as c — w. Right? 
Well, no. 
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This common sense view is encapsulated in the Gahlean transformations that we 
met in Section 1.2.1. Mathematically, we derive this "obvious" result as follows: two 
inertial frames, S and S', which move relative to each with velocity v = {v, 0, 0), have 
Cartesian coordinates related by 

x' — X — vt , y' — y , z' — z , t' — t (7.1) 

If a ray of light travels in the x direction in frame S with speed c, then it traces out 
the trajectory x/t — c. The transformations above then tell us that in frame S' the 
trajectory if the hght ray is x'/t' — c — v. This is the result we claimed above: the 
speed of light should clearly he c — v. If this is wrong (and it is) something must be 
wrong with the Galilean transformations (7.1). But what? 

Our immediate goal is to find a transformation law that obeys both postulates above. 
As we will see, the only way to achieve this goal is to allow for a radical departure in 
our understanding of time. In particular, we will be forced to abandon the assumption 
of absolute time, enshrined in the equation t' — t above. We will see that time ticks at 
different rates for observers sitting in different inertial frames. 

7.1 Lorentz Transformations 

We stick with the idea of two inertial frames, S and S', moving with relative speed v. 
For simplicity, we'll start by ignoring the directions y and z which arc perpendicular to 
the direction of motion. Both inertial frames come with Cartesian coordinates: {x, t) 
for S and {x',t') for S'. We want to know how these are related. The most general 
possible relationship takes the form 

x' ^ f{x,t) , t' ^ g{x,t) 

for some function / and g. However, there are a couple of facts that we can use to 
immediately restrict the form of these functions. The first is that the law of inertia 
holds; left alone in an inertial frame, a particle will travel at constant velocity. Drawn 
in the {x, t) plane, the trajectory of such a particle is a straight line. Since both S and 
S' are inertial frames, the map {x, t) ^ {x', t') must map straight lines to straight lines; 
such maps are, by definition, linear. The functions / and g must therefore be of the 
form 

x' — aix + , t' — a^x + a^t 
where ccj, i = 1, 2, 3, 4 can each be a function of v. 
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Secondly, we use the fact that S' is travelhng at speed v relative 
to S. This means that an observer sitting at the origin, x' = 0, 
of S' moves along the trajectory x = vt in S shown in the figure. 
Or, in other words, the points x — vt must map to x' — 0. (There 
is actually one further assumption implicit in this statement: that 
the origin x' — coincides with x — when t — 0). Together with 
the requirement that the transformation is linear, this restricts 
the coefficients ai and a2 above to be of the form, 

x' = ^{x — vt) 




Figure 43: 



(7.2) 



for some coefficient 7. Once again, the overall coefficient 7 can be a function of the 
velocity: 7 = 7^. (We've used subscript notation ■jy rather than the more standard ■j{v) 
to denote that 7 depends on v. This avoids confusion with the factors of {x — vt) which 
aren't arguments of 7 but will frequently appear after 7 like in the equation (7.2)). 

There is actually a small, but important, restriction on the form of 7^: it must be 
an even function, so that 7„ = 7_^. There are a couple of ways to see this. The first 
is by using rotational invariance, which states that 7 can depend only on the direction 
of the relative velocity v, but only on the magnitude = v • v. Alternatively, if this 
is a little slick, we can reach the same conclusion by considering inertial frames S and 
S' which are identical to S and S' except that we measure the x-coordinate in the 
opposite direction, meaning x = —x and x' = —x'. While S is moving with velocity 
^v relative to 5", S is moving with velocity —v with respect to S' simply because we 
measure things in the opposite direction. That means that 

x' = 7_„ {x + vi) 

Comparing this to (7.2), we see that we must have 7^ = 7_^ as claimed. 

We can also look at things from the perspective of 5", relative to 
which the frame S moves backwards with velocity —v. The same 
argument that led us to (7.2) now tells us that 



X — 7(x' + vi') 



(7.3) 



Now the function 7 = 7_t,. But by the argument above, we know 
that 7t, = 7_t,. In other words, the coefficient 7 appearing in (7.3) 
is the same as that appearing in (7.2). 




Figure 44: 
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At this point, things don't look too different from what we've seen before. Indeed, if 
we now insisted on absolute time, so t = t', we're forced to have 7 = 1 and we get back 
to the Galilean transformations (7.1). However, as we've seen, this is not compatible 
with the second postulate of special relativity. So let's push forward and insist instead 
that the speed of light is equal to c in both S and S'. In S, a hght ray has trajectory 

X — ct 

While, in S', we demand that the same light ray has trajectory 

x' = ct' 

Substituting these trajectories into (7.2) and (7.3), we have two equations relating t 
and t', 

ct' — 7(c — v)t and ct — j{c + v)t' 
A little algebra shows that these two equations are compatible only if 7 is given by 

We'll be seeing a lot of this coefficient 7 in what follows. Notice that for v <^ c, we 
have 7^1 and the transformation law (7.2) is approximately the same as the Galilean 

transformation (7.1). However, as v — c we have 7 00. Furthermore, 7 becomes 
imaginary for f > c which means that we're unable to make sense of inertial frames 
with relative speed v > c. 

Equations (7.2) and (7.4) give us the transformation law for the spatial coordinate. 
But what about for time? In fact, the temporal transformation law is already lurking in 
our analysis above. Substituting the expression for x' in (7.2) into (7.3) and rearranging, 
we get 

^' = 7 - (7.5) 

We shall soon see that this equation has dramatic consequences. For now, however, we 
merely note that when v <^ c, we recover the trivial Galilean transformation law t' ~ t. 
Equations (7.2) and (7.5) are the Lorentz transformations. 
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7.1.1 Lorentz Transformations in Three Spatial Dimensions 

In the above derivation, we ignored the transformation of the coordinates y and z 
perpendicular to the relative motion. In fact, these transformations arc trivial. Using 
the above arguments for linearity and the fact that the origins coincide at t — 0, the 
most general form of the transformation is 

y' = «y 

But, by symmetry, we must also have y — Ky' . Clearly, we require k — 1. (The other 
possibility K — —1 does not give the identity transformation when v — Q. Instead, it is 
a reflection) . 

With this we can write down the final form of the Lorentz transformations. Note 
that they look more symmetric between x and t if we write them using the combination 
ct, 

x' = 7 ct^ 

y' ^y 

z' = z (7.6) 
ct' — 7 (^ct x^ 

where 7 is given by (7.4). These are also known as Lorentz boosts. Notice that for 
v/c <^ 1, the Lorentz boosts reduce to the more intuitive Galilean boosts that we saw 
in Section 1. (We sometimes say, rather sloppily, that the Lorentz transformations 
reduce to the Galilean transformations in the limit c 00). 

It's also worth stressing again the special properties of these transformations. To be 
compatible with the first postulate, the transformations must take the same form if we 
invert them to express x and t in terms of x' and t' , except with v replaced by —v. 
And, after a little bit of algebraic magic, they do. 

Secondly, we want the speed of light to be the same in all inertial frames. For light 
travelling in the x direction, we already imposed this in our derivation of the Lorentz 
transformations. But it's simple to check again: in frame the trajectory of an object 
travelling at the speed of light obeys x = ct. In S', the same object will follow the 
trajectory x' = 7(0; — vt) = jlct — vx/c) = ct'. 
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Figure 45: The worldline of a particle 



Figure 46: Light rays travel at 45° 



What about an object travelling in the y direction at the speed of light? Its trajectory 
m. S is y ~ ct. From (7.6), its trajectory in S' is y' = ct' and x' = —vt'. Its speed in 



7.1.2 Spacetime Diagrams 

We'll find it very useful to introduce a simple spacetime diagram to illustrate the physics 
of relativity. In a fixed inertial frame, S, we draw one direction of space — say x — 
along the horizontal axis and time on the vertical axis. But things look much nicer if 
we rescale time and plot ct on the vertical instead. In the context of special relativity, 
space and time is called Minkowski space. (Although the true definition of Minkowski 
space requires some extra structure on space and time which we will meet in Section 



This is a spacetime diagram. Each point, P, represents an event. In the following, 
we'll label points on the spacetime diagram as coordinates (ct, x) i.e. giving the coor- 
dinate along the vertical axis first. This is backwards from the usual way coordinates 
but is chosen so that it is consistent with a later, standard, convention that we will 
meet in Section 7.3. 

A particle moving in spacetime traces out a curve called a worldline as shown in 
the figure. Because we've rescaled the time axis, a light ray moving in the x direction 
moves at 45°. We'll later see that no object can move faster than the speed of light 
which means that the worldlines of particles must always move upwards at an angle 
steeper than 45°. 



S' is therefore v''^ — vl + vf,, or 




(7.3)). 



- 109 - 




The horizontal and vertical axis in the spacetime diagram are the coordinates of the 
inertial frame S. But we could also draw the axes corresponding to an inertial frame 
S' moving with relative velocity v = {v, 0, 0). The t' axis sits at a;' = and is given by 

X — vt 

Meanwhile, the x' axis is determined hyt' — O which, from 
the Lorentz transformation (7.6), is given by the equation 

V 

ct = -X 

c 

These two axes arc drawn on the figure to the right. They 
can be the thought of as the x and ct axes, rotated by an 
equal amount towards the diagonal light ray. The fact 
the axes are symmetric about the light ray reflects the 
fact that the speed of hght is equal to c in both frames. 

7.1.3 A History of Light Speed Figure 47: 

The first evidence that light does not travel instantaneously was presented by the 
Danish Astronomer Ole R0mcr in 1676. He noticed that the periods of the orbits of lo, 
the innermost moon of Jupiter, arc not constant. When the Earth is moving towards 
Jupiter, the orbits arc a few minutes shorter; when the Earth moves away, the orbits 
are longer by the same amount. R0mer correctly deduced that this was due to the 
finite speed of hght and gave a rough estimate for the value of c. 

By the mid 1800s, the speed of light had been determined fairly accurately using 
experiments involving rotating mirrors. Then came a theoretical bombshell. Maxwell 
showed that light could be understood as oscillations of the electric and magnetic 
fields. He related the speed of light to two constants, sq and /iq, the permittivity and 
permeability of free space, that arise in the theory of electromagnetism. 



(7.7) 



But, as we have seen, Newtonian physics tells us that speeds are relative. If Maxwell's 
equations predict a value for the speed of light, it was thought that these equations must 
be valid only in a preferred reference frame. Moreover, this does not seem unreasonable; 
if light is a wave then surely there is something waving. Just as water waves need 
water, and sound waves need air, so it was thought that light waves need a material 
to propagate in. This material was dubbed the luminiferous ether and it was thought 
that Maxwell's equations must only be valid in the frame at rest with respect to this 
ether. 
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In 1881, Michelson and Morley performed an experiment to detect the relative motion 
of the Earth through the ether. Since the Earth is orbiting the Sun at a speed of 
3 X lO'' ms~^, even if it happens to be stationary with respect to the ether at some 
point, six months later this can no longer be the case. 

Suppose that at some moment the Earth is moving in the x-direction relative to 
the ether with some speed v. The Newtonian addition of velocities tells us that light 

propagating in the x-direction should have speed c + v going one way and c — v going 
the other. The total time to travel backwards and forwards along a length L should 
therefore be 

_ L L 2cL 



C + V C — V (? — 

Meanwhile, light making the same journey in the y-direction will have to travel (by 
Pythagoras) a total distance of -y/L^ + v^iTyjlY on each leg of the journey. It makes 
this journey at speed c, meaning that we can equate 



The goal of the Michelson-Morley experiment was to measure the time difference be- 
tween Ty and using interference patterns of light ray making the two journeys. 
Needless to say, the experiment didn't work: there seemed to be no difference in the 
time taken to travel in the x direction and y direction. 

Towards the end of the 1800s, the null result of the Michelson-Morley experiment 
had become one of the major problems in theoretical physics. Several explanations 
were proposed, including the idea that the ether was somehow dragged along with the 
Earth. The Dutch physicist, Hendrik Lorentz, went some way to finding the correct 
solution. He had noticed that Maxwell's equations had the peculiar symmetry that we 
now call the Lorentz transformations. He argued that if a reason could be found that 
would allow distances between matter to change as 

x' — 7(x — vi) 

then lengths would be squeezed in the direction parallel to the ether, explaining why 
no difference is seen between and Ty. (We will shortly derive this contraction of 
lengths using special relativity). Lorentz set to work trying to provide a mechanical 
explanation for this transformation law. 
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Figure 48: Simultaneity is relative 



Although Lorentz had put in place much of the mathematics, the real insight came 
from Einstein in 1905. He understood that there is no mechanical mechanism un- 
derlying the Lorentz transformations. Nor is there an ether. Instead, the Lorentz 
transformations are a property of space and time themselves. 

With Einstein's new take on the principle of relativity, all problems with Maxwell's 
equation evaporate. There is no preferred inertial frame. Instead, Maxwell's equations 
work equally well in all inertial frames. However, they are not invariant under the 
older transformations of Galilean relativity; instead they are the first law of physics 
to be invariant under the correct transformations (7.6) of Einstein/ Lorentz relativity. 
It's worth pointing out that, from this perspective, we could dispense with the second 
postulate of relativity all together. We need only insist that the laws of physics - which 
include Maxwell's equations - hold in all inertial frames. Since Maxwell's equations 
predict (7.7), this implies the statement that the speed of light is the same in all inertial 
frames. But since we haven't yet seen the relationship between Maxwell's equations, 
light and relativity, it's perhaps best to retain the second postulate for now. 

7.2 Relativistic Physics 

In this section we will explore some of the more interesting and surprising consequences 
of the Lorentz transformations. 

7.2.1 Simultaneity 

We start with a simple question: how can we be sure that things happen at the same 
time? In Newtonian physics, this is a simple question to answer. In that case, we have 
an absolute time t and two events. Pi and P2, happen at the same time if ti = ^2- 
However, in the relativistic world, things are not so easy. 



- 112 - 



We start with an observer in inertial frame 5*, with time coordinate t. This observer 
sensibly decides that two events, Pi and P2, occur simultaneously if ti = t2- In the 
spacetime diagram on the left of Figure 48 we have drawn lines of simultaneity for this 
observer. 

But for an observer in the inertial frame S', simultaneity of events occurs for equal 
t'. Using the Lorentz transformation, lines of constant t' become lines described by the 
equation t — vx/c^ = constant. These lines are drawn on the spacetime diagram on the 
right of Figure 48. 

The upshot of this is that two events simultaneous in one inertial frame are not 
simultaneous in another. An observer in S thinks that events Pi and P2 happen at the 
same time. All other observers disagree. 

A Train Story 




Figure 49: Lights on Trains: Simultaneity is Relative 



The fact that all observers cannot agree on what events are simultaneous is a 
direct consequence of the fact that all observers do agree on the speed of light. We 
can illustrate this connection with a simple gedankenexperiment. (An ugly German 
word for "thought experiment" , a favourite trick of theoretical physicists who can't be 
bothered to do real experiments). Consider a train moving at constant speed, with a 
lightbulb hanging from the middle of one of the carriages. A passenger on the train 
turns on the bulb and, because the bulb is equidistant from both the front and back 
wall of the carriage, observes that the light hits both walls at the same time. 

However, a person standing on the platform as the train passes through disagrees. 
The light from the bulb travels at equal speed ±c to the left and right, but the back of 
the train is rushing towards the point in space where the light first emerged from. The 
person on the platform will see the light hit the back of the train first. 
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It is worth mentioning that although the two people disagree on whether the light 
hits the walls at the same time, this does not mean that they can't be friends. 

A Potential Confusion: What the Observer Observes 

We'll pause briefly to press home a point that may lead to confusion. You might 
think that the question of simultaneity has something to do with the finite speed of 
propagation. You don't see something until the light has travelled to you, just as you 
don't hear something until the sound has travelled to you. This is not what's going on 
here! A look at the spacetime diagram in Figure 48 shows that we've already taken 
this into account when deciding whether two events occur simultaneously. The lack of 
simultaneity between moving observers is a much deeper issue, not due to the finiteness 
of the speed of light but rather due to the constancy of the speed of light. 

The confusion about the time of flight of the signal is sometimes compounded by 
the common use of the word observer to mean "inertial frame" . This brings to mind 
some guy sitting at the origin, surveying all around him. Instead, you should think of 
the observer more as a Big Brother figure: a sea of clocks and rulers throughout the 
inertial frame which can faithfully record and store the position and time of any event, 
to be studied at some time in the future. 

Of course, this means that there is a second question we can ask which is: what 
does the guy sitting at the origin actually see? Now we have to take into account 
both the relative nature of simultaneity and the issues related with the finite speed of 
propagation. This adds an extra layer of complexity which we will discuss in Section 
7.6. 

7.2.2 Causality 

We've seen that different observers disagree on the temporal ordering of two events. 
But where does that leave the idea of causality? Surely it's important that we can say 
that one event definitely occurred before another. Thankfully, all is not lost: there are 
only some events which observers can disagree about. 

To see this, note that because Lorentz boosts are only possible for v < c, the fines of 
simultaneity cannot be steeper than 45°. Take a point P and draw the 45° light rays 
that emerge from P. This is called the light cone. (For once, in the figure, I've drawn 
this with an extra spatial dimension present to illustrate how this works in spatial 
dimensions bigger than one). The light cone is really two cones, touching at the point 
P. They are known as the future light cone and past light cone. 
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For events inside the light cone of P, there is no dif- i ct 
ficulty deciding on the temporal ordering of events. All ob- 
servers will agree that Q occurred after P. However, for events 
outside the light cone, the matter is up for grabs: some ob- 
servers will see R as happening after P; some before. 



>s P 



►R 



This tells us that the events which all observers agree can 
be causally influenced by P are those inside the future light 
cone. Similarly, the events which can plausibly influence P Figure 50: 

are those inside the past light cone. This means that we can 

sleep comfortably at night, happy in the knowledge that causality is preserved, only if 
nothing can propagate outside the light cone. But that's the same thing as travelling 
faster than the speed of hght. 

The converse to this is that if we do ever see particles that travel faster than the 
speed of light, we're in trouble. We could use them to transmit information faster than 
light. But another observer would view this as transmitting information backwards in 
time. All our ideas of cause and effect will be turned on their head. You will therefore 
be relieved to learn that we will show in Section 7.3 why it is impossible to accelerate 
particles past the light speed barrier. 

There is a corollary to the statement that events outside the lightcone cannot influ- 
ence each other: there are no perfectly rigid objects. Suppose that you push on one 
end of a rod. The other end cannot move immediately since that would allow us to 
communicate faster than the speed of light. Of course, for real rods, the other end docs 
not move instantaneously. Instead, pushing on one end of the rod initiates a sound 
wave which propagates through the rod, telling the other parts to move. The state- 
ment that there is no rigid object is simply the statement that this sound wave must 
travel slower than the speed of hght. 

Finally, let me mention that when we're talking about waves, as opposed to point 
particles, there is a slight subtlety in exactly what must travel slower than light. There 
are at least two velocities associated to a wave: the group velocity is (usually) the speed 
at which information can be communicated. This is less than c. In contrast, the phase 
velocity is the speed at which the peaks of the wave travel. This can be greater than 
c, but transmits no information. 

7.2.3 Time Dilation 

We'll now turn to one of the more dramatic results of special relativity. Consider a 
clock sitting stationary in the frame S' which ticks at intervals of T'. This means that 
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the tick events in frame S' occur at {ct[, 0) then {ct[ + cT', 0) and so on. What are the 
intervals between ticks in frame 5? 

We can answer immediately from the Lorentz transformations (7.6). Inverting this 
gives 

/ , vx'\ 

The clock sits at x' = 0, so we immediately learn that in frame S, the interval between 
ticks is 

T = -fT' 

This means that the gap between ticks is longer in the stationary frame. A moving 
clock runs more slowly. But the same argument holds for any process, be it clocks, 
elementary particles or human hearts. The correct interpretation is that time itself 
runs more slowly in moving frames. 

Another Train Story 




o o o:j a) 

Figure 51: More Lights on Trains: Time Dilation 



Let's go back to our lightbulb and gedankenbahn. If the train has height h, a 
passenger on the train will measure time t' = h/c for the light to travel from the 
light bulb to the middle of the floor (i.e. the point directly below the light bulb). 
What about for the guy on the platform? After the light turns on, the train has moved 
forward at speed v. To hit the same point on the floor, the light has to travel a distance 
^/h?~+Jvt)^. The time taken is therefore 



This gives another, more pictorial, derivation of the time dilation formula. 
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On Muons and Planes 

Away from the world of gedankenexperiments, there are a couple of real experimental 
consequences of time dilation. Certainly the place that this phenomenon is tested most 
accurately is in particle accelerators where elementary particles routinely reach speeds 
close to c. The protons spinning around the LHC have 7 ^ 3500. The previous collider 
in CERN, called LEP, accelerated electrons and positrons to 7 ~ 2 x 10^. (Although 
the electrons in LEP were travelling faster than the protons in LHC, the greater mass 
of the protons means that there is substantially more energy in the LHC collisions) . 

The effect of time dilation is particularly vivid on unstable particles which live much 
longer in the lab frame than in their own rest frame. An early demonstration was seen 
in muons in 1941. These are heavier, unstable, versions of the electron. They decay 
into an electron, together with a couple of neutrinos, with a half-life of r ^ 2 x 10^^ s. 
Muons arc created when cosmic rays hit the atmosphere, and subsequently rain down 
on Earth. Yet to make it down to sea level, it takes about t = 7 x 10^^ s, somewhat 
longer than their lifetime. Given this, why are there any muons detected on Earth 
at all? Surely they should have decayed. The reason that they do not is because the 
muons are travelling at a speed v ~ 0.99c, giving 7 10. Prom the muon's perspective, 
the journey only takes t' — t/^ ^ 7 x 10"'^ s, somewhat less than their hfetime. 

Note that elementary particles are, by definition, structureless. They're certainly 
not some clock with an internal machinery. The reason that they live longer can't be 
explained because of some mechanical device which slows down: it is time itself which 
is running slower. 

A more direct test of time dilation was performed in 1971 by Hafele and Keating. 
They flew two atomic clocks around the world on commercial airliners; two more were 
left at home. When they were subsequently brought together, their times differed by 
about 10^^ s. There are actually two contributions to this effect: the time dilation 
of special relativity that we've seen above, together with a related effect in general 
relativity due to the gravity of the Earth. 

Twin Paradox 

Two twins, Luke and Leia, decide to spend some time apart. Leia stays at home while 
Luke jumps in a spaceship and heads at some speed v to the planet Tatooine. With 
sadness, Leia watches Luke leave but is relieved to see — only a time T later from her 
perspective — him safely reach the planet. 
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However, upon arrival, Luke finds that he doesn't hke Tatooine so much. It is a 
dusty, violent place with little to do. So he turns around and heads back to Leia at the 
same speed v as before. When he returns, he finds that Leia has aged by Tteia = 2T. 
And yet, fresh faced Luke has only aged by TLuke = 2T/7. We see, that after the 
journey, Luke is younger than Leia. In fact, for large enough values of 7, Luke could 
return to find Leia long dead. 

This is nothing more than the usual time dilation story. So why is it a paradox? 
Well, things seem puzzling from Luke's perspective. He's sitting happily in his inertial 
spaceship, watching Leia and the whole planet fiying off into space at speed v. From 
his perspective, it should be Leia who is younger. Surely things should be symmetric 
between the two? 

The resolution to this "paradox" is that there is no symmetry between Luke's journey 
and Leia's. Leia remained in an inertial frame for all time. Luke, however, does not. 
When he reaches Tatooine, he has to turn around and this event means that he has to 
accelerate. This is what breaks the symmetry. 



We can look at this in some detail. We draw the space- 
time diagram in Leia's frame. Luke sits at a; = vt, or x' = 0. 
Leia sits at x = 0. Luke reaches Tatooine at point P. We've 
also drawn two lines of simultaneity. The point Y is when 
Leia thinks that Luke has arrived on Tatooine. The point X 
is where Luke thinks Leia was when he arrived at Tatooine. 
As we've already seen, it's quite ok for Luke and Leia to dis- 
agree on the simultaneity of these points. Let's figure out the 
coordinates for X and Y. 



ct 



1 P=Amval 



Luke 



Figure 52: 



Event Y sits at coordinate (cT, 0) in Leia's frame, while P is at {cT,vT). The time 
elapsed in Luke's frame is just the usual time dilation calculation. 



T 

7 



We can also work out the coordinates of the event X. Clearly this takes place at x — 
in Leia's frame. In Luke's frame, this is simultaneous with his arrival at Tatooine, so 
occurs at t' — T' — T/^. We can again use the Lorentz transformation 



2 

V X 
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now viewed as an equation for t given x and t'. This gives us 

^ _T' _T 
7 7^ 

So at this point, we see that everything is indeed symmetric. When Luke reaches 
Tatooine, he thinks that Leia is younger than him by a factor of 7. Meanwhile, Leia 
thinks that Luke is younger than her by the same factor . 

Things change when Luke turns around. To illustrate this, let's first consider a 
different scenario where he doesn't return from Tatooine. Instead, as soon as he arrives, 
he synchronises his clock with a friend - let's call him Han - who is on his way to meet 
Leia. Now things are still symmetric. Luke thinks that Leia has aged by on the 
outward journey; Han also thinks that Leia has aged by on the inward journey. 
So where did the missing time go? 

We can see this by looking at the spacetime diagram of ^ 
Han's journey. We've again drawn lines of simultaneity. From 
Han's perspective, he thinks that Leia is sitting at point Z 
when he leaves Tatooine, while Luke is still convinced that 
she's sitting at point X. It's not hard to check that at point z 
Z, Leia's clock reads t ^ 2T - T/j^. ^ 

Prom this perspective, we can also see what happens if Luke X 

does return home. When he arrives at Tatooine, he thinks 

Leia is at point X. Yet, in the time he takes to turn around 

and head home, the acceleration makes her appear to rapidly 7^ 

age, from point X to point Z. 

Figure 53: 

7.2.4 Length Contraction 

We've seen that moving clocks run slow. We will now show that moving rods are 
shortened. Consider a rod of length L' sitting stationary in the frame S'. What is its 
length in frame 5"? 

To begin, we should state more carefully something which seems obvious: when we 
say that a rod has length L', it means that the distance between the two end points 
at equal times is L'. So, drawing the axes for the frame S', the situation looks like 
the picture on the left. The two, simultaneous, end points in S' are Pi and P2. Their 
coordinates in S' are {ct',x') = (0,0) and (0,L') respectively. 
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Now let's look at this in frame S. This is drawn in right-hand picture. Clearly Pi 
sits at {ct,x) — (0,0). Meanwhile, the Lorentz transformation gives us the coordinate 
for P2 

T' A + 

X — 7L and t — — — 

But to measure the rod in frame 5, we want both ends to be at the same time. And 
the points Pi and P2 are not simultaneous in S. We can follow the point P2 backwards 
along the trajectory of the end point to Q2, which sits at 

X — 7L' — vt 

We want Q2 to be simultaneous with Pi in frame S. This means we must move back 
a time t = 7wL/c^, giving 

^v^L' L' 
X = 7L — = — 

7 

This is teUing us that the length L measured in frame S is 

7 

It is shorter than the length of the rod in its rest frame by a factor of 7. This phe- 
nomenon is known as Lorentz contraction. 

Putting Ladders in Barns 

Take a ladder of length 2L and try to put it in a barn of length L. If you run fast enough, 
can you squeeze it? Here are two arguments, each giving the opposite conclusion 
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• From the perspective of the barn, the ladder contracts to a length 2L/7. This 
shows that it can happily fit inside as long as you run fast enough, with 7 > 2 

• Prom the perspective of the ladder, the barn has contracted to length L/7. This 
means there's no way you're going to get the ladder inside the barn. Running 
faster will only make things worse 

What's going on? As usual, to reconcile these two points of view we need to think more 
carefully about the question we're asking. What does it mean to "fit a ladder inside 

a barn"? Any observer will agree that we've achieved this if the back end gets in the 
door before the front end hits the far wall. But we know that simultaneity of events 
is not fixed, so the word "before" in this definition suggests that it may be something 
different observers will disagree on. Let's see how this works. 



The spacetime diagram in the frame of the barn is 
drawn in the figure with 7 > 2. We sec that, from the 
barn's perspective, both back and front ends of the ladder 
arc happily inside the barn at the same time. We've also 
drawn the line of simultaneity for the ladder's frame. This 
shows that when the front of the ladder hits the far wall, the 
back end of the ladder has not yet got in the door. Is the 
ladder in the barn? Well, it all depends who you ask. 

7.2.5 Addition of Velocities 



A ct 



door 



wall 




Figure 55: 



A particle moves with constant velocity u' in frame S' which, in turn, moves with 
constant velocity v with respect to frame S. What is the velocity u of the particle as 
seen in 5"? 

The Newtonian answer is just u — u' -\- v. But we know that this can't be correct 
because it doesn't give the right answer when u' — c. So what is the right answer? 

The worldline of the particle in S' is 

x' = u't' 

So the velocity of the particle in frame S is given by 

X 7(0;' + vt') 



(7.8) 



u 



t -f{t' + vx'/c^) 



which follows from the Lorentz transformations (7.6). (Actually, we've used the inverse 
Lorentz transformations since we want S coordinates in terms of S' coordinates, but 
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these differ only changing —v to v). Substituting (7.8) into the expression above, and 
performing a httle algebra, gives us the result we want: 



Note that when u' — c, this gives us it = c as expected. 

We can also show that if < c and 1^1 < c then we necessarily have —c<u<c. 
The proof is simple algebra, if a little fiddly 



where the last equality follows because, by our initial assumptions, each factor in the 
final expression is positive. An identical calculation will show you that —c<uas well. 
We learn that if a particle is travelling slower than the speed of light in one inertial 
frame, it will also be travelling slower than light in all others. 

7.3 The Geometry of Spacetime 

The views of space and time which I wish to lay before you have sprung from 
the soil of experimental physics, and therein lies their strength. They are 
radical. Henceforth space by itself, and time by itself, are doomed to fade 
away into mere shadows, and only a kind of union of the two will preserve 
an independent reality. 

Hermann Minkowski, 1908 

We have seen that time is relative, length is relative, simultaneity is relative. Is 
nothing sacred anymore? Well, the answer is yes: there is one measurement that all 
observers will agree on. 

7.3.1 The Invariant Interval 

Let's start by considering a spacetime with just a single spatial coordinate, x. In 
frame S, two events Pi and P2 have coordinates {cti,xi) and {ct2,X2). The events are 
separated by At = ti — 12 in time and Ax = Xi — X2 in space. 

We define the invariant interval As^ as a measure of the distance between these two 



u' + V 



(7.9) 



u — 



1 + u'v/c^ 




points: 



As' 



= c'At'' - Ax' 
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The advantage of the invariant interval is that it is something all observers agree upon. 
In frame S', we have 




(7.10) 



where, in going from the first line to the second, we see that the cross-terms At' Ax' 



Including all three spatial dimensions, the definition of the invariant interval is 



which, again, is the same in all frames. (The only non-trivial part of the calculation is 
(7.10) above since y and z are invariant under a boost in the x direction). 

The spacetime of special relativity is topologically R^. When endowed with the 
measure of distance (7.11). this spacetime is referred to as Minkowski space. Although 
topologically equivalent to Euclidean space, distances are measured differently. To 
stress the difference between the time and spatial directions, Minkowski space is some- 
times said to have dimension d = 1-1-3. (For once, it's important that you don't do 
this sum!). 

In later courses — in particular General Relativity — you will see the invariant 
interval written as the distance between two infinitesimally close points. In practice 
that just means we replace all the A(something)s with o?(something)s. 



In this infinitesimal form, ds^ is called the line element. 

The invariant interval provides an observer-independent characterisation of the dis- 
tance between any two events. However, it has a strange property: it is not positive 
definite. Two events whose separation is As^ > are said to be timelike separated. 
They are closer together in space than they are in time. Pictorially, such events sit 
within each others hght cone. 



cancel out. 



As^ = c'At^ - Ax" - - Az" 



(7.11) 



ds^ = (?dt^ — dx^ — dy^ — dz^ 
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In contrast, events with As^ < are said to be spacelike separated. They sit outside 
each others hght cone. From our discussion in Section 7.2.2, we know that two observers 
can disagree about the temporal ordering of spacehke separated events. However, 
they agree on the ordering of timehke separated events. Note that since As^ < for 
spacehke separated events, if you insist on talking about As itself then it must be 
purely imaginary. However, usually it will be perfectly fine if we just talk about As^. 

Finally, two events with As^ = are said to be lightlike separated. Notice that this 
is an important difference between the invariant interval and most measures of distance 
that you're used to. Usually, if two points are separated by zero distance, then they are 
the same point. This is not true in Minkowski spacetime: if two points are separated 
by zero distance, it means that they can be connected by a light ray. 

A Rotational Analogy 

There's a simple analogy to understand the meaning of the invariant interval. Let's go 
back to consider three dimensional Euclidean space with coordinates x = (x, y, z). An 
observer measures the position of a stationary object — let's say a helicopter — and 
proudly announces the x and y and z coordinates of the helicopter. 

Meanwhile, a second observer shares the same origin as the first, but has rotated his 
axes to use coordinates x' = [x' , y' , z') where x' = i?x for some rotation matrix R. He 
too sees the hehcopter, and declares that it sits at coordinates x' , y' and z' . 

Of course, there's no reason why the coordinates of the two observers should agree 
with each other. However, there is one quantity that should be invariant: the distance 
from the origin (which is shared by both observers) to the hehcopter. In other words, 
we should find that 

2 2,2,2 /2 I /2 I /2 /f? i o\ 

^Euclidean = 2: +y +z ^x +y +z (7.12) 
And, of course, this is indeed true if the rotation matrix obeys R^R = 1. 

The essence of special relativity is nothing more than an extrapolation of the dis- 
cussion above. The Lorentz boosts can be should be thought of as a rotation between 
space and time. The individual spatial and temporal coordinates are different for the 
two observers, but there remains an invariant distance. The only thing that's different 
is that the time and space directions in this invariant distance (7.11) come with differ- 
ent minus signs. We'll now explore this relationship between boosts and rotations in 
some detail. 
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7.3.2 The Lorentz Group 

We have defined the interval (7.11) as the measure of distance which is invariant under 
Lorentz transformations. However, it is actually better to look at things the other way: 
the invariant interval is the primary object. This is a property of spacetime which 
defines the Lorentz transformations. Let's see how the argument runs this way around. 

If we sit at the origin in a fixed frame S, the coordinates of an event can be written 
as a four vector X. We won't denote that this is a vector by bold font or squiggly 
underlines (which we're really saving for three-dimensional spatial vectors). We're just 
getting sophisticated now and just the capital letter will have to sufiice. However, we 
will sometimes use index notation, in which the components of the 4-vector are 

^ {ct,x,y,z) = 0,1,2,3 

Note that we write the indices running from = to = 3 rather than starting at 1. 
The zeroth component of the vector is time (multiplied by c). The invariant distance 
between the origin and the point P can be written as an inner product, defined as 

X-X = X^rjX = X^'ri^^X" (7.13) 

In the first expression above we are using matrix- vector notation and in the second we 
have resorted to index notation, with the summation convention for both indices /i and 
u. The matrix rj is given by 

/l \ 
0-10 
0-10 

Vo 0-1/ 



This matrix is called the Minkowski metric. With this expression for the Minkowski 
metric, the inner product becomes 



X-X ^ cH'^ -x^ -y"^ -z^ 

which is indeed the invariant distance (7.11) between the origin and the point X as 
promised. 

Following our characterisation of distances using the invariant interval, a four vector 
obeying X ■ X > Q is said to be timelike] one with X ■ X < is said to be spacelike] 
and one with X • X — Q is said to be lightlike or, alternatively, null. 
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The Lorentz transformation can be thought of as a 4 x 4 matrix A, rotating the 
coordinates in frame S to coordinates in frame S', such that the four vector becomes 



This can also be written index notation as X'^ — A^^X'^. The Lorentz transformations 
are defined to be those matrices which leave the inner product invariant. This means 
that 

X' -X' = X -X 

From our definition (7.13), we see that this is true only if A obeys the matrix equation 

A'^r^A = ri (7.14) 

Let's try to understand the solutions to this. We can start by counting how many we 
expect. The matrix A has 4 x 4 = 16 components. Both sides of equation (7.14) are 
symmetric matrices, which means that the equation only provides 10 constraints on 
the coefficients of A. We therefore expect to find 16 — 10 = 6 independent solutions. 

The solutions to (7.14) fall into two classes. The first class is very familiar. Let's 
look at solutions of the form 



A 





^ 










R 








(7.15) 



where it! is a 3 x 3 matrix. These transformations change space, but leave time intact. 
The condition (7.14) reduces to a condition for the matrix R, 



R'R 



where the right-hand side is understood to be the 3x3 unit matrix. But this is 
something that we've seen before: it is the requirement for i? to be a rotation matrix. 
There are three such independent matrices, corresponding to rotations about the three 
different spatial axes. 
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The remaining three solutions to (7.14) are the Lorentz boosts that have preoccupied 
us for much of this Section. The boost along the x axis is given by 



A 



V 



7 


— 7v/c 


jv/c 


7 




















\ 




1 







1 



(7.16) 



These are precisely the Lorentz transformations (7.6). Two further solutions to (7.14) 
come from boosting along the y and z directions. 

The set of all matrices A obeying (7.14) form the Lorentz group, denoted 0(1,3). You 
can easily check that they indeed obey all axioms of a group. Taking the determinant 
of both sides of (7.14), we see that detA^ = 1, so the Lorentz group splits into two 
pieces with detA = ±1. The subgroup with detA = 1 is called the proper Lorentz 
group and is denoted SO (1,3). 

There is one further decomposition of the Lorentz group. Any element can either flip 
the direction of time or leave it invariant. Those transformations which preserve the 
direction of time are called orthochronous. The group of proper orthochronous Lorentz 
transformations is denoted SO'^{l, 3) although people like me are usually lazy and just 
refer to it as SO{l, 3). 

Rapidity 

We previously derived the velocity addition law (7.9). Let's sec how we get this from 
the matrix approach above. We can focus on the 2x2 upper-left hand part of the 
matrix in (7.16). We'll write this as 



7 



-■yv/c 



A[vi]A[v2\ 



—^v/c 7 

If we combine two boosts, both in the x direction, the resulting Lorentz transformation 
is 

7i -livi/c\ ( 72 -I2V2/C 

-llVi/c 71 ) \-72t'2/c 72 

It takes a little bit of algebra, but multiplying out these matrices you can show that 

Vi + V2 
1 + ViV2/c^ _ 

which is again the velocity addition rule (7.9), now for the composition of boosts. 



AH AH = A 
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The algebra involved in the above calculation is somewhat tedious; the result some- 
what ugly. Is there a better way to see how this works? We can get a clue from the 
rotation matrices R. Recall that the 2x2 matrix which rotates a plane by angle 9 is 



— sin 9 cos 9 




If we perform two rotations in succession, we have 



R[9i]R[92] = R[9i + 92] 



But the nice addition rule only worked because we were clever in parameterising our 
rotation by an angle. In the case of Lorentz boosts, there is a similarly clever parame- 
terisation. Instead of using the velocity v, we define the rapidity (p by 

7 = cosh ip 

We can see one of the nice things about this definition if we look at 



This is the other component of the Lorentz boost matrix. We can therefore write 



Looking again at the composition of two Lorentz boosts, we see that the rapidities add, 
just hke the angles of rotation 



The matrix description of the Lorentz boost (7.17) shows most clearly the close rela- 
tionship between rotations and boosts. 

7.3.3 A Rant: Why c = 1 

We started this section by mentioning that the speed of light, c = 299792458 ms^^ is 
exact. The only reason that this fundamental constant is exactly an integer is because 
the meter is defined to be the distance travelled by fight in 1/299792458 seconds. 





(7.17) 



A[iPi\ A[cp2\ = A^l + ip2\ 
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In our everyday world, the meter is a very useful unit. It is roughly the size of most 
things in my house. But viewed from the perspective of fundamental physics, it is 
rather parochial. If we're going to pick the speed of light to be an integer, we should 
probably pick one that is easier to remember. Like c—1. We can do this by picking a 
different unit of length, namely 

c—1 (light second) 

where a light second is the distance travelled by light in one second. 

There's a better way of thinking about this: the existence of a universal speed of light 
is Nature's way of telling us that space and time arc more similar than our ancestors 
realised. We only labelled space and time with different units because we were unaware 
of the relationship between them. 

We can illustrate this by going back to the rotational analogy. Suppose that you 
decided that all distances in the x-direction should be measured in centimeters, while 
all distances in the y-direction should be measured in inches. You then declared that 
there was a new, fundamental constant of Nature - let's call it A - given by 

A 2.54 cm {inch)~^ 

Why is this a dumb thing to do? The reason it's dumb is because of the rotational 
symmetry of the laws of physics: different observers have different x and y coordinates 
and can quite happily pick the same unit of measurement for both. But we've learned 
in this section that there is also a symmetry between space and time. Insisting that we 
retain the conversion factor c in the fundamental laws of physics is no more sensible 
than retaining A. 

Despite my rant, in these lectures, we will retain c in all equations. (Although we 
will use units which allow us to set A = 1). But in future courses, it is common practice 
to work with the convention c—1. The equations look simpler and the only price you 
pay is that the units of time and space are equivalent. If, at the end of the day, you 
want to get your answer in terms of meters or seconds or whatever then you can always 
put the factors of c back in by dimensional analysis. 

7.4 Relativistic Kinematics 

So far, our discussion has been focussed on what the world looks like to different 
observers. Let's now return to the main theme of these lectures: the motion of particles. 
Remember that our ultimate goal is to construct laws of physics which look the same 
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to all inertial observers. For this reason, we will start by defining some of the basic 
elements that go into the laws of physics: velocity, momentum and acceleration. We 
want to define these in such a way that they have nice transformation properties when 
viewed from different inertial frames. 

7.4.1 Proper Time 

We started these lectures in Section 1 by describing the trajectory of particle in an 
inertial frame in terms of a curve x(t) and velocity u = dyi/dt. There's nothing incorrect 
with this description in special relativity but, as we will see, there's a much better way 
to parameterise the trajectory of a particle. 

Let's start by considering a particle at rest at the origin of frame S' with x' = 0. 
The invariant interval between two different points on the worldline of the particle is 



We see that the invariant interval between two points on the worldline is proportional 
to the time experienced by the particle. But this must be true in all frames. The time 
experienced by the particle is called the proper time, r. In all frames, it is given by 



where As is real as long as the particle doesn't travel faster than the speed of light, so 
it sits on a timclike trajectory. (Wc keep promising to prove that a particle is unable 
to travel faster than light... we are almost there!) 

Proper time provides a way to parameterise the trajectory of a particle in a manner 
that all inertial observers will agree on. Consider the trajectory of a general particle, not 
necessarily travelling in a straight line. Viewed from an inertial frame S, the worldline 
can be parameterised by x(t) and ^(t). This has several advantages. 

For example, we can use this formulation to determine the time experienced by a 
particle moving along a general trajectory. Along a small segment of its trajectory, a 
particle experiences proper time 



.12 




from which we have 



dt 
dr 



7 



(7.18) 
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Note that 7 here is a function of the speed, u, of the particle seen by the observer in S. 
From this, the total time T experienced by a particle as it travels along its worldline is 
simply the sum of the proper times associated to each small segment, 



T 



(...) 






7.4.2 4- Velocity 

We'll now explain why it's useful to parameterise the trajectory of a particle in terms 
of proper time r. We can write a general trajectory in spacetime using the 4-vector: 

X{t) = 

Prom this, we can define the 4-'(^elocity, 

dr 

Using the relationship (7.18) between the proper time of the particle r and the ob- 
server's time t we can write this as 

"'"'^-,{1) (7.20) 

where u = d'^/dt. This definition of the 4- velocity has a nice property: if an observer 
in frame S measures a particle's 4-velocity as U , then an observer in frame S' with 
X' — KX will measure the 4-velocity 

U' = MJ (7.21) 

This transformation holds only because dr is invariant, meaning that it is the same for 
all observers. In contrast, if we had tried to define a 4-velocity by, say, V = dX/dt then 
both X and t would change under a Lorentz transformation and we would be left with 
a messy, complicated expression for V in frame S'. Our definition of U differs from V 
by the extra factor of 7 in (7.20). This is all important! 

Because of the simple transformation law (7.21), we can immediately import some of 
the things that we learned from our previous discussion of Lorentz groups. In particular, 
from the definition of A given in (7.14), we know that the inner product 

U-U^ U^rjU 

is invariant. It is the same for all observers: U • U — U' ■ U' . 
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Let's look at a simple example. A particle which is stationary in frame S has 4- 
velocity 

= (c, 0, 0, 0) 

and so U ■ U — (?. But this must be true in all frames. We can check this exphcitly 
from (7.20) (we'll take the middle equation to illustrate the point) which gives us 



This result also helps answer a puzzle. In Newtonian mechanics, if we want to specify 
the velocity, we only have to give three numbers u. But in special relativity, the velocity 
is a 4- vector U . Nonetheless, we still only need specify three variables because U is not 
any 4- vector: it is constrained to obey U -U — (? . 

Addition of Velocities Revisited 

In Section 7.2.5, we derived the rule for the addition of velocities in one-dimension. 
But what if the velocity of a particle is not aligned with the relative velocity between 
S and S"? The addition of velocities in this case is simple to compute using 4-vectors. 
We start with a particle in frame S travelling with 4-velocity 

/ luC \ 

w^u COS a 
wju sin a 



U 



\ 







/ 



Here we've added the subscript to 7^ = (1 — -u^/c^)"^/^ to distinguish it from the 7- 
factor arising between the two frames. Frame S' moves in the x-direction with speed v 
relative to S. The Lorentz boost is given in (7.16). In frame S', the 4-velocity is then 

^ (1 — {uv/ 0^)008 aj'yyc\ ( ^„,c \ 
{ucosa — v)^v 





v 



7«'C 
u'^u' COS a' 
u'^u' sin a' 



I 



\ 







(7.22) 



/ 



Dividing the t and x components of this 4- vector, we recover the velocity transformation 
law (7.9) for the speed in the x-direction, namely 



v! cos ol — 



u cos a — V 
\ — uv cos ajc? 
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Meanwhile, dividing the y component by the x component gives us a formula for the 
angle a' that the particles trajectory makes with the a;'-axis, 

, Msina /„ 
tana' = — (7.23) 

7t,(-u cosa — v) 

There is an interesting application of this formula to stellar aberration. The apparent 
position of the stars changes over a 6 month period as the Earth moves around the 
Sun. This is not due to difference in position of the Earth relative to the star: the 
size of the Earth's orbit is puny compared to the distance to the star. Instead, it is 
a special relativistic effect, due to the motion of the Earth around the Sun. We can 
derive this effect by applying the formulae above to hght. Choose coordinate system 
so that the position of the star — and hence the velocity of starlight — lies in the 
(a;, y) plane, while the Earth's orbit lies in the (x, z) plane. Then we simply need to 
substitute m = c in the above equation, while v is the component of the Earth's velocity 
in the x-direction. 

The algebra can get somewhat messy, so it's worth walking you through the steps. 
It's best to return to (7.22), now with -u = -u' = c as befits a light ray. (It's advisable 
to first substitute the factor of 7„/ in the x and y components for the expression for 7^/ 
given in the temporal component). This then gives us 

, c sin a 1 c cos a — v 

sina = — -, and cos a — 



7t,, (c — cos a) c — v cos a 

We now use the trig identity tan(a;y 2) = sin«'/(l+cosa'). Substituting the expressions 
above, a little algebra then results in the stellar aberration formula. 



a' Ic + v a 

tan — = \ tan — 

2 M c-v 2 

Here a is the angle that the star would appear at if the Earth was at rest. Or, said 
another way, a is the angle at which the star appears at when the motion of the Earth 
is perpendicular to the direction of the star. Meanwhile, a' is the angle at which the 
star appears when the Earth is travelling with speed v in the x-direction. As the Earth 
moves around the Sun, the component of its velocity v in the x-direction changes, 
meaning that the angle a' at which the star appears also changes. 

7.4.3 4-Momentum 

The 4-'m'0iTientum is defined by 



P = mU= { ' (7.24) 
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where m is the mass of the particle, usually referred to as the rest mass. Importantly, 
it will turn out that P is the quantity that is conserved in the relativistic context. The 
spatial components give us the relativistic generalisation of the 3-momentum, 

p = m^u (7-25) 

Notice that as the particle approaches the speed of light, u ^ c, the momentum diverges 
p — > oo. Since momentum is conserved in all processes, this is really telling us that 
massive particles cannot break the speed of light barrier. (Here the word "massive" 
doesn't mean "really really big": it just means "not massless", or m 7^ 0). This is 
sometimes interpreted by viewing the quantity rwy as a velocity-dependent relativistic 
mass. In these terms, the relativistic mass of the particle diverges m'-f ^ 00 as the 
particle approaches the speed of light. The words may be different, but the maths (and 
underlying physics) is the same: particles are bound by Nature's speed limit. Nothing 
can travel faster than the speed of light. 

What is the interpretation of the time-component of the momentum 4- vector, P°. 
We can get a hint of this by Taylor expanding the 7 factor, 

P° = , "^"^ =-( mc^ + -me + . . (7.26) 

The first term is just a constant. But the second term is something familiar: it is the 
non-relativistic kinetic energy of the particle. This, coupled with the fact that all four 
components of P are conserved, strongly suggests that the right interpretation of P° is 
the energy of the particle (divided by c), so 

(7.27) 

To show that P° is indeed related to the energy in this way requires a few more 
techniques than we will develop in this course. The cleanest way is to use Noether's 
theorem - which we mentioned briefly in Section 5.1.4 - for relativistic systems and see 
that P° is the quantity that follows from time translational invariance^. 

The expansion of (7.26) shows that both the mass and the kinetic energy contribute 
to the energy of a particle. These combine to give 

E = m-fc^ (7.28) 




''^You can read more about this for particle mechanics in the Classical Dynamics lecture notes 
and, for relativistic field theories, in the Quantum Field Ttieory lecture notes. Both are available at 
http: / / www.damtp.cani.ac.uk/ user/tong/ teaching.html 
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Notice that as the particle approaches the speed of hght, its energy diverges. Yet again, 
we see a barrier to breaking the speed hmit: as wc approach the speed of hght, the 
energy required to make a particle go just a little faster gets bigger and bigger. 



For a stationary particle, all its energy is contained in its rest mass, giving us the 
famous slogan 

E = mc^ 

There's a nice way to rearrange (7.28), to replace the u in the 7 factor with p defined 
in (7.25). But the algebra is laborious. Instead there's a cute trick that gives the result 
much more quickly: we look at the inner product P-P. In the rest frame of the particle, 
P = {mc, 0, 0, 0) and we have 

P • P = m^c^ (7.29) 
But the inner product is an invariant, holding in any frame. From (7.27), we have 



P-P = 




Equating these two expressions give 

£;2 = pV + mV (7.30) 

This is the generahsation of E — mc^ to include the kinetic energy. This equation can 
also be derived the hard way by playing around with (7.28) and (7.25). 

The identification P^ = E/c has dramatic consequences. In Newtonian mechanics, 
wc boasted about the conservation of energy, but implicit in everything wc did was the 
more elementary fact that mass is conserved. Even in the variable mass problems of 
Section 5.3, the mass never disappered: it just left our rocket ship. However, relativity 
teaches us that the conservation of mass is subsumed into the conservation of energy. 
There is nothing that guarantees that they are individually conserved. Just as potential 
energy can be converted into kinetic energy, so too can mass be converted into kinetic 
energy. In Japan, in 1945, this fact was vividly demonstrated. 

7.4.4 Massless Particles 

Until now, we built our discussion of particle trajectories on proper time. But, looking 
back at Section 7.4.1, proper time is only defined for time-like trajectories. This is 
fine for massive particles. But what about for massless particles? We can sidestep the 
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need for proper time by looking at the invariant of the 4-momentum (7.29) which, for 
particles with m = 0, tells us that the 4-momentum must be null, 







This means that the 4-momentum of a massless particle necessarily lies along a light 
ray. 

This fact also allows us to clarify one of our original postulates of special relativity: 
that the speed of light is the same for all inertial frames. You may wonder why the 
propagation of light, an electromagnetic phenomenon, is singled out for special treat- 
ment. The answer is: because the photon - the particle of light - is massless. In fact, 
a better way of stating the postulate is to say that there is an upper speed limit in the 
Universe, which is the same for all inertial observers. Any massless particle must travel 
at this speed limit. All massive particles must go slower. 

We know of only two types of massless particles in the Universe: the photon and the 
graviton. Both of these owe their particle-like nature to quantum mechanics (actually, 
this is true of all particles) and have a classical analog as light waves and gravity 
waves respectively. You've all seen light waves (literally!) and individual photons have 
been routinely measured in experiments for more than a century. In contrast there is 
good, but indirect, evidence for gravity waves (this is expected to change soon, with 
experiments up and running which are designed to directly detect gravity waves). There 
appears to be no hope at all of detecting an individual graviton, at least within our 
lifetimes. 

Until the late 1990s, it was thought that neutrinos were also massless. It is now 
known that they have a small, but finite mass. (Actually, there's a caveat here: there 
are three different types of neutrino: an electron neutrino, a muon neutrino and a tau 
neutrino. The differences between their masses are known to be of order of 0.01 -0.1 eV 
and there are constraints which limit the sum of their masses to be no greater than 
0.3 eV or so. But the absolute scale of their masses has not yet been determined. In 
principle, one of the three neutrinos may be massless) . 

From (7.30), the energy and momentum of a massless particle are related by E"^ = 
p^c^. The four momentum takes the form 




where p is a unit vector in the direction of the particle's motion. 
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To get an expression for the energy, we need a result from quantum mechanics which 
relates the energy to the wavelength A of the photon or, equivalently, to the angular 
frequency ui = 2nc/X, 

2'Khc 

E ^Tvjj ^ — — 
A 

There's something rather nice about how this equation ties in with special relativity. 
Suppose that in your frame, the photon has energy E. But a different observer moves 
towards the light with velocity v. By the Lorcntz transformation, he will measure the 4- 
momentum of the photon to be P' = AP and, correspondingly, will sec a bigger energy 
E' > E. From the above equation, this implies that he will see a smaller wavelength. 
But this is nothing other than Lorentz contraction. 

The phenomenon of different observers observing different wavelengths of light is 
called the Doppler effect. You will derive this in the problem sheet. 

Tachyons and Why They're Nonsense 

It is sometimes stated that a particle which has imaginary mass, so that < 0, 
will have P ■ P < and so travel consistently at speeds u > c. Such particles are 
called tachyons. They too would be unable to cross Nature's barrier at m = c and are 
consigned to always travel on spacelike trajectories. 

Although, consistent within the framework of classical relativistic particle mechanics, 
the possibility of tachyons does not survive the leap to more sophisticated theories 
of physics. All our current best theories of physics are written in the framework of 
quantum field theory. Here particles emerge as ripples of fields, tied into small lumps 
of energy by quantum mechanics. But in quantum field theory, it is not unusual to 
have fields with imaginary mass < 0. The resulting particles do not travel faster 
than the speed of light. Instead, imaginary mass signals an instability of the vacuum. 

7.4.5 Newton's Laws of Motion 

Finally, we are in a position to write down Newton's law of motion in a manner that 
is consistent with special relativity: it is 

^ = F'^ (7.31) 
or 

where are the components of a 4- vector force. It is not difficult to anticipate that 
the spatial components of F should be related to the 3- vector force f . (This is the same 
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thing that we've been calhng F up until now, but we'll lower its standing to a small / 
to save confusion with the 4- vector). In fact, we need an extra factor of 7, so 




With this factor of 7 in place, the spatial components of Newton's equation (7.31) agree 
with the form that we're used to in the reference frame S, 

dp dr dp Idp ^ 

dt dt dr 7 dr 

Similarly, a quick calculation shows that the temporal component F° is related to the 
power: the rate of change of energy with time 

dr c dt 

With these definitions, we can derive a familiar equation, relating the change in energy 
to the work done. Consider a particle with constant rest mass m, so that P ■ P — m^c^ 
is unchanging. Using P° = 77170 and p = m.'ju, we have 



d „n dP dp 2 fdE 

— iP • P = 2P° — 2p ■ = 27^771 — - u • f 

dr^ ' dr ^ dr ' \ dt 







All of this is just to show how the familiar laws of Newtonian physics sit within special 
relativity. 

Electromagnetism Revisited 

Ironically, equation (7.31) is rarely used in relativistic physics! The reason is that by 
the time we are in the relativistic realm, most of the forces that we've come across are 
no longer vahd. The one exception is the electromagnetic force law for a particle of 
charge q that we met in Section 2.4. This does have a relativistic formulation, with the 
equation of motion given by 



dP" 



dr c 

where is the 4-velocity of the particle and G^^ is the electromagnetic tensor, a 4 x 4 
matrix which contains the electric and magnetic fields, 

^ E\ E2 E^ ^ 
Ei CP3 -CB2 
E2 -cBs cBi 
\Es CB2 -cBi / 



G^ 
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(This tensor often goes by the name F'^, but we've chosen to call it G to save confusion 
with the force 4- vector). The spatial components of the four- vector equation gives rise 
to the familiar Lorentz force law (2.18). The temporal component gives the rate of 
work done, dE/dt = qE ■ u. 

7.4.6 Acceleration 

We can construct a four- vector for acceleration simply by 

A= — 
dr 

Note that because U • U = c^, we must have that A is always orthogonal to U in the 
Minkowski sense: A-U = 0. 

Suppose that the velocity of a particle in frame S is u. Then, in this frame, the 
Newtonian notion of 3-acceleration is a = du/dt. Recalling our expression relating 
time and proper time, dt/dr = 7, we see that the four acceleration actually depends 
on both u and a; it is 



A 



with 7 = dj/dt. 

Let's now suppose that we sit in an inertial frame S' in which, at a fixed moment of 
time t, the particle is instantaneously at rest. Obviously, if the particle is accelerating, 
this will not coincide with the particle's rest frame an instant later, but momentarily 
this will do fine. Since u' = in this frame, the 4- acceleration is 




A' 







a' 



with a' = du'/dt'. (Note that you need to do a small calculation here to check that 
7(m = 0) = 0). But, since we have constructed our acceleration as a 4- vector, A and 
A' must be related by a Lorentz transformation. To make matters easy for ourselves, 
let's take both u and a to lie in the x-direction so that we can consistently ignore the 
y and ^-directions. Then the Lorentz transformation tells us 
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From the top component, we can determine the relationship between the accelerations 
a and a' seen in the two frames, 



u 



(1 - u^/c^) 



,2^3/2 ^, 



Suppose now that the particle undergoes constant acceleration. As with everything 
in special relativity, we need to be more careful about what we mean by this. The 
natural interpretation is that the acceleration in the frame of the particle is constant. 
Mathematically, this means that a' is constant. In contrast, viewed from frame S", the 
acceleration is not constant. Indeed, for constant a', we can integrate our equation 
above to get u, the velocity seen in frame as a function of time. If we assume that 
M = when t = 0, we have 



a'ct 



u 



lit) 



1 + 



a' 



(7.32) 



Vc2 + a'2t2 

Since u = X, integrating the first of these equations once more gives us the position in 
the frame S function of time, 



X 



(7.33) 



where we've picked an integration constant so that x = at time t = 0. We see that 
the particle moves on the hyperbola shown in the figure. Viewed from S, the particle 
approaches, but never reaches, the speed of light. 



A ct 




Notice that a particle at point P in the diagram 
can only receive information from within its own past 
lightcone, denoted by the red dotted lines in the figure. 
However, if it continues along its accelerated trajectory, 
it can never receive any information from the whole part ^ 
of spacetime to the left of the null line x = ct. This 
part of the Universe will forever remain a mystery to an 
accelerated observer. The null cone, defined by, x = ct, 
which forms the boundary of the mysterious region is 
called the Rindler event horizon. It has many things 
in common with the event horizon of a black hole and, 
indeed, the Rindler horizon is often used as a toy model to understand some of the 
stranger aspects of black hole physics. Of course, if an accelerated observer really 
wants to see what's behind the horizon, it's easy: he just stops accelerating. If an 
observer in the background of a black hole wants to see what's behind the horizon, he 
must be somewhat braver. 



Figure 56: 
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We can look at what the accelerated observer feels. His time is simply the proper 
time of the particle. To compute this, the form of 7(t) given in (7.32) is particularly 
useful. From (7.19), if time t elapses in the stationary frame S, then the particle feels 




:2 + a'2f2 a' 



cdt 



— sinh 



-1 




This analysis gives us a more quantitative way to view 
the twin paradox. Suppose that Luke undertakes his 
trip to Tatooine on a trajectory of constant acceleration. 
He leaves Leia at the time i < where their worldlines 
intersects, arrives at Tatooine at t = and x = c^/a', 
and returns back to Leia as shown. Leia experiences 
time t; Luke time t < t. 



A ct 



Luke 




Leia 



X 



Finally, we can look at how far the accelerated ob- 
server thinks he has travelled. Of course, this observer 

is not in an inertial frame, but at any time t we can con- Figure 57: 

sider the inertial frame that is momentarily at rest with 

respect to the accelerated particle. This allows us to simply use the Lorentz contraction 
formula. Using our results (7.32) and (7.33), we have 



Curiously, x' — >■ c^/a' is finite as t ^ oo or, equivalently, as r ^ oo. Despite all that 
effort, an accelerated observer doesn't think he has got very far! This again, is related 
to the presence of the horizon. 

7.4.7 Indices Up, Indices Down 

The minus signs in the Minkowski metric rj means that it's useful to introduce a slight 
twist to the usual summation convention of repeated indices. For all the 4- vectors that 
we introduced above, we were careful always place the spacetime index // = 0, 1, 2, 3 as 
a superscript (i..e up) rather than a subscript. 



This is because the same object with an index down, X^, will mean something subtley 
different! 




X" = {ct, x) 



= {ct, -x) 
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With this convention, the Minkowski inner product can be written using the usual 
convention of summing over repeated indices as 

In contrast, writing X^X^ — c^i^ + x^ is a dumb thing to write in the context of special 
relativity since it looks very different to observers in different inertial frames. In fact, 
we will shortly declare it illegal to write things hke X^^X^. 

There is a natural way to think of X^ in terms of X^. If we write the Minkowski 
metric as the diagonal matrix rj^y = diag(+l, —1, —1, —1) then we can raise and lower 
indices using 7)^1, and the summation convention, so 

Moreover, we will insist that all objects with indices up and down are similarly related 
by contracting with rj. For example, we could write the electromagnetic tensor as 

El E2 Ej, ^ 
-El -cB^ CB2 
-E2 cBs -cBi 

-E3 -CB2 cBi y 

The object G^,^ is actually somewhat more natural than G^^^ since the former is anti- 
symmetric. 

To raise indices back up, we need the inverse of 77^1^ which, fortunately, is the same 
matrix: r)^'^ — diag(-|-l, —1, —1, —1). We have 

This trick of distinguishing between indices up and indices down provides a simple 
formalism to ensure that all objects have nice transformation properties under the 
Lorentz group. Wc insist that, just as in the usual summation convention, repeated 
indices only ever appear in pairs. But now we further insist that pairs always appear 
with one index up and the other down. The result will be an object with is invariant 
under Lorentz transformations. 

In future courses, you will learn that there is somewhat deeper mathematics lying 
behind distinguishing X^ and X^: formally, these objects live in different spaces (some- 
times called dual spaces). Objects such as X^ are said to be covariant vectors, while 
X,, is said to be a contravariant vector. 
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7.5 Particle Physics 

"Oh, that stuff. We never bother with that in our work" 



Ernest Rutherford, the first particle physicist, discussing relativity 

Our goal in this section is to describe various relativistic phenomena that arise in 
particle physics. All these processes occur in the absence of external forces, so F = 
and we will rely only on conservation of 4-momentum, meaning 



Of course, conservation of 4-momentum includes both conservation of 3-momentum 
and conservation of energy. 

The calculations that follow are similar in spirit to the collision calculations of Section 
5.2. Before we proceed, there are a couple of hints that may help when solving these 
problems. Firstly, we need to choose a frame of reference in which to calculate: the 
smart frame to choose is nearly always the centre of mass of the system. (Which should 
more correctly be called the centre of momentum frame, for it is the one with vanishing 
spatial 3-momentum). Secondly, you will often be presented with a situation where 
there is one particle with momentum P about which you know nothing. A good way 
to eliminate this is often to rearrange your equation so it takes the form P — . . . and 
then square it to get the right-hand side to be P • P = m^c^. Let's now see how this 
works in a few examples. 

7.5.1 Particle Decay 

Consider a single particle with rest mass mi which decays into two particles with rest 
masses m2 and 777.3. Conservation of 4-momentum tells us 



El — rriic^ — ^pIc'^ + rri^d^ -\- \jpicP- + rnid^ > m2(? + m^(? 

which tells us the unsurprising result that a particle can only decay if its mass is greater 
than that of its decay products. In the problem sheet, you will be asked to compute 



Pi = P2 + P3 



or, equivalently. 



Ei^ E2 + Es and Pi = P2 + Ps 



In the rest frame of the decaying particle, we can write (using (7.30)), 
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the velocities V2 and V3 of the decay products in the centre of mass frame and show 
that they are given by 

m? + rrin — ml , m? + ml — ml 

12^^T, — and 73= ^ ^ 



2mim2 2mim3 
Here we will instead look at some slightly different problems. 

An Example: Higgs Decay 

The LHC has taught us that the Higgs boson has mass m^c^ 125 GeV. It mostly 
decays into two photons. In particle physics, photons are always denoted by 7. Do 
not confuse them with the Lorentz contraction factor! The "equations" in which the 
photon 7's appear are more like chemical reactions than true equations. The decay of 
the Higgs into two photons is written as 

/i — > 77 

Similar decays occur for other particles, most notably the neutral pion, a meson (mean- 
ing that it is made of a quark and anti-quark) with mass m^c^ ^ 140 MeV. This too 
decays as 7r° — > 77. 

To be concrete (and more relevant!) we'll focus on the Higgs. Conservation of 4- 
momentum tells us (in, hopefully, obvious notation) that 

Ph — P'r + Pj 

If we sit in the rest frame of the Higgs, so Pj(^ = {mhC, 0), the photons must have equal 
and opposite 3-momcntum, and therefore equal energy = ^mhC^- The photons must 
be emitted back-to-back but, because the problem is rotationally symmetric, can be 
emitted at any angle. 

What if we're not sitting in the rest frame of the Higgs?. Suppose that the Higgs 
has energy Eh and the energy of one of the photons is measured to be E^. What is the 
angle 9 that this photon makes with the path of the Higgs? 

We'll use the strategy that we described above. We have no information about the 
second photon, with 4-momentum P^. So we rearrange the conservation of momentum 
to read P!^ — Ph — Pj. Upon squaring this, we have P!^- P!^ — 0, so 



= (P;, - P^) ■ {Ph - P^) ^Ph-Ph + P^-P^- 2Ph ■ P 



7 



2 2 '^EhE^i , ^ 
= m^c + 2iph ■ P7 



^2^2 _ "^EhE^ ^ 2E^ ^^^^^ I ^2 /^2 _ ^2^2 



cos 6^ E\l(? — m\(P- 
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where, in the last equation, we have used E'^ = p^(? + rn^c^ (which is just E — pc for 
the photon). This can now be rearranged to give the answer for 9. 

Figure 58: Collisions in the centre of mass frame on the left and the lab frame on the right 
7.5.2 Particle Collisions 

Let's now look at the physics of relativistic collisions. We'll collide two particles to- 
gether, both of mass m. They will interact in some manner, preserving both energy 
and 3-momentum, and scatter at an angle 9. 

Pl + P2^P3 + P4 

As we mentioned previously, it's easiest to see what happens in the centre of mass frame. 
Without loss of generality, we'll take the initial momenta to be in the x-direction. After 
the collision, the particles must have equal and opposite momenta, which means they 
must also have equal energy. This, in turn, ensures that in the centre of mass frame, 
the speed v after the collision is the same as before. We can choose our axes so that 
the initial and final momenta are given by 

= {mc-fy, mf7„, 0, 0) , P^ = {mcj^, -mv-f^, 0, 0) 

P^ = {mcjy, mvjy cos 9, mvjy sin 9, 0) , P^ = {mcjy, —rnvj^ cos 9, —rnvj^ sin 9, 0) 

where we've put the subscript on 7^ to denote its argument. We can also look at the 
same collision in the lab frame. This refers to the situation where one of the particles is 
initially at rest. (Presumably in your lab). By the velocity addition formula, the other 
particle must start with speed 

2v 

u = 

i + vyd" 

You can also derive this result by writing down the momenta P[ and P2 in the lab 
frame and equating (Pi + Pg)^ = (Pi' + 
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In the lab frame, the angles (j) and a at which the particles scatter are not equal. 
They can be easily determined using the addition of 4-velocities that we saw in Section 
7.4.2 . Set u = —V in equation (7.23) and use the identity tan(a;/2) = sina;/(l + cosa;) 
to get 



One of the more interesting examples of collisions is Compton Scattering, in which the 
colour of light changes after scattering off an electron (because it changes its energy 
and therefore its frequency). You will derive this result in the examples sheet. 

Pcirticle Creation 

Just as mass can be converted into kinetic energy, so kinetic energy can be converted 
into mass through the creation of new particles. Roughly speaking, this is the way we 
discover new particles of Nature. 

Suppose we collide two particles, each of mass m. After the collision, we hope to 
be left with these two particles, together with a third of mass M. How fast must the 
original two particles coUide? 

Conservation of momentum gives us 



where = P2 — — Pi — rr?(?, while P^ — M'^(?. Let's work in the centre of mass 
frame of the colliding particles, each of which has speed v. In this case, we have 



Expanding out the bracket on the right-hand side gives us nine terms. To each of 
these, we will apply a Cauchy-Schwarz-like inequality. For any massive particles with 
4-momenta P and Q, such that P^ = mfc^ and = m\(?, we necessarily have 
P ■ Q > mi'm2(?. It is simplest to prove this by working in a frame in which one 
particle is stationary. Then wc have 



tan0 = — tan^/2 and tana 

Iv 



tan(e/2 + 7r/2) 



Pi + P2 = n + + n 



(Pi + P^f = Am'-iy = (P3 + P4 + P,f 



(7.34) 



p-g 



( 



rriic 







) 




miE2 = mi W m|c^ + p| 



> mim2(? 



Applied to (7.34), this gives 



4m^7^c^ > 4m^c^ + M^c^ + 4Mmc^ =^ 
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This makes sense. The amount of minimum amount of kinetic energy per particle is 
kMc^. With this minimum amount, the two colhding particles can 
combine their kinetic energies to form the new particle. After the collision, all three 
particles are then at rest. 

What if we re-do this experiment in the lab frame, in which of the original particles is 
at rest and the other has speed m? Now we have Pi = {rri'yuC, mjuu) and P2 = {mc, 0), 
so 



(Pi + P2] 



Rearranging gives 



= Pi + P| + 2Pi ■ P2 



2M M2 
7« > 1 + + 



m 2m? 

Now we see it's not so easy to create a particle. It's certainly not enough to give the 
incoming particle kinetic energy T = |Mc^ as one might intuitively expect. Instead, 
if you want to create very heavy particles, M ^ m, you need to give your initial 
particle a kinetic energy of order T ^ M^c^/2m. This scales quadratically with M, 
rather than the linear scaling that we saw in the centre of mass frame. The reason 
for this simple: there's no way that the end products can be at rest. The need to 
conserve momentum means that much of the kinetic energy of the incoming particle 
goes into producing kinetic energy of the outgoing particles. This is the reason that 
most particle accelerators have two colliding beams rather than a single beam and a 
stationary target. 



The LHC primarily collides protons in its search 
to discover new elementary particles. However, for one 
month a year, it switches to collisions of lead nuclei in 
an attempt to understand a new form of matter known 
as the quark-gluon plasma. Each lead nuclei contains 
around 200 protons and neutrons. The collision results 
in a dramatic demonstration of particle creation, with 
the the production of many thousands of particles - 
protons, neutrons, mesons and baryons. Here's a very 
pretty picture. It's one of the first collisions of lead 
nuclei at LHC in 2010, shown here in all its glory by the ALICE detector. 
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7.6 Spinors 

In this final section, we return to understand more of the mathematical structure 
underlying spacetime and the Lorentz group. Ultimately, the new structure that we 
will uncover here has very important implications for the way the Universe works. But 
we will also see a nice application of our new tools. 

Let's start by recalling our definition of the Lorentz group. We introduced elements 
of the group as 4 x 4 real matrices satisfying 

Pj'rjK = 7] 

where rj — diag(l, —1, —1, —1) is the diagonal Minkowski metric. Elements with detA = 
1 define the group S0{1,3). If we further restrict to elements with the upper-left 
component A°q > 0, which ensures that the transformation does not flip the direction 
of time, then we have the sub-group S0~^{1, 3). As we will now see, there's some rather 
beautiful subtleties associated with this group. 

7.6.1 The Lorentz Group and SL{2,C) 

The Lorentz group S0^{1, 3) is (almost) the same as the rather different looking group 
SL{2, C), the group of 2 x 2 complex matrices with determinant one. We will start by 
providing the map between these two groups, and explaining what the word "almost" 
means. 

Before we talk about Lorentz transformations, let's first go back to think about the 
points in Minkowski space themselves. So far, we've been labelling these by the 4- 
vector — {ct, x, y, z). But there is alternative way of labelling these points, not by 
a 4- vector but instead by a 2 x 2 Hermitian matrix. Given a 4- vector X, we can write 
down such a matrix X by 

X _ ( ct + z x-iy\ 
yx + iy ct — z J 

which clearly satisfies X = X^ . Moreover, this is the most general form of a 2 x 2 
Hermitian matrix. This means that there is a one-to-one map between 4-vectors X 
and 2x2 Hermitian matrices. We can equally well take the latter to define a point in 
Minkowski space. 
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We learned earlier that Minkowski space comes equipped with an inner product 
structure on 4-vectors. The inner product X ■ X measures the distance in spacetime 
between the origin and the point X. But this is very natural in terms of the matrix 
language: it is simply the determinant 

X ■ X ^detX ^ cH'^ -x^ -y'^-z^ 

With this new way of labelling points in Minkowski space using the matrices X, 
we can return to think about Lorentz transformations. Recall that, by definition, a 
Lorentz transformation is a linear map which preserves the inner-product structure on 
Minkowski space. Let's consider a general matrix A e SL{2, C). We can use this to 
define a linear map 

X X'^AXA"^ (7.35) 

By construction, if X = X"^ then we also have X' = {X')\ so X' also defines a point 
in Minkowski space. Moreover, 

det X' = det{AXA^) = det A det X det A^ = det X 

where the last equahty follows because det A = 1. This means that the map (7.35) 
preserves the inner product on Minkowski space and therefore defines a Lorentz trans- 
formation. 

We may wonder if all Lorentz transformations can be implemented by suitable choices 
of A. The answer is yes. We'll exhibit the map explicitly below, but first let's just count 
the dimension of the two groups to make sure we stand a chance of it working. A general 
complex 2x2 matrix has 4 complex entries. The requirement that its determinant is 
1 reduces this to 3 complex parameters, or 6 real parameters. This agrees with the 
dimension of the Lorentz group: 6 = 3 rotations -|- 3 boosts. 

Although the dimensions of 5'0+(l, 3) and SL{2, C) are equal, they are not quite the 
same groups. In some sense, SL(2,C) is twice as big. The reason is that the matrices 
A and —A both implement the same Lorentz transformation in (7.35). We say that 
SL{2, C) is the double cover of S0^{1, 3) or, alternatively, 

50+(l,3) ^ SL{2,C)/Z2 

Mathematically, there is a 2:1 group homomorphism between SL{2,C) and 5*0*^1,3). 
The word "homomorphism" means that the group structure is preserved under this 
map. The existence of this double cover leads to some quite extraordinary consequences. 
But, before we get to these, let's first just look at how the map works in more detail. 
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Rotations 



We've seen that points in Minkowski space can be written as a 4- vector X or Hermitian 
matrix X. Meanwhile, Lorentz transformations act as X — AX or X — ^ AXA"!". Here 
we would like to be more explicit about which matrices A correspond to the different 
Lorentz transformations. 

We start with rotations. By definition, these are the transformations which leave 
time untouched. From (7.35), this means that we want matrices A which map X = ctl 
(where 1 here is the unit 2x2 matrix) to itself. In other words, rotations should obey 

AA^ = 1 

But such matrices are familiar unitary matrices. We learn that rotations sit in the 
subgroup A e SU{2) C SL{2, C). You may be used to thinking of the rotation group 
as SO (3) rather than SU (2). But these are almost the same thing: SU (2) is the double 
cover of SO{3), 

SO{3) ^ SU{2)/Z2 

Let's sec how this equivalence between matrices R € 5*0(3) matrices and A e SU{2) 
works. For rotations around the x-axis, we have 

/l \ 



R 



cos 9 sin 9 
— sin ^ cos 9 



± 



cos(^/2) ism{9/2) 
ism{9/2) cos(^/2) 



To see this, you just need to substitute the matrix A into the map (7.35) and check 
that it reproduces the same rotation as the matrix R. Note the ± possibility on A 
which reflects the fact that SL{2, C) is the double cover of the Lorentz group. This is 
also related to the fact that the angle in A is ^/ 2 rather than 9: we will return to this 
shortly. For rotations about the y-axis, we have 

^ cos 9 1 sin ^ \ 



R^ 



1 

— sin 9 1 cos 9 



A = ± 



cos{9/2) sin(^/2) 
-s\n{9/2) cos{9/2) 



Finally, for rotations about the z-axis, we have 

/ cos^ sin^ 0\ 
R— — sin ^ cos 9 
1 



V 



A^± 



7 




ie/2 
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There's a somewhat nicer way of writing these matrices which makes their structure 
clearer. To see this, we first need to introduce the Pauli matrices, 



Together with the unit matrix, these form a basis of 2 x 2 Hermitian matrices. They 
have the nice property that cV-^ = 6"^^ + ie'-^^a^ . In general, a rotation by angle 9 
around an axis with unit vector n is associated to the unitary matrix 

A = ±exp(^ynV^ (7.37) 

Of course, the discussion above also tells us how the rotations fit within the Lorentz 
group. The matrix A remains unchanged, while the Lorentz transformation A is con- 
structed by embedding the orthogonal matrix R in the lower-right block as shown in 
(7.15). 

Boosts 

The Pauli matrices also provide a simple way to describe the A G SL{2, C) corre- 
sponding to Lorentz boosts. A boost with rapidity (p in the direction n is associated 
to 

A = ±exp(^-|nV) (7.38) 

Unlike rotations, these matrices are not unitary. This ensures that they affect the time 
component. Again, you can check that this reproduce the Lorentz boosts of the form 
(7.17) simply by substituting this expression for A into the map (7.35). For example, 
a boost in the ^-direction is given by the matrix 

This tells us that x and y are left unchanged, while t' + z' = e~'^{t + z) and t' — z' — 
e'^'^{t — z). Doing the algebra gives 

t' — cosh (ft — sinh (fz , z' — cosh (fz — sinh (p t 

which indeed agrees with the usual form of the Lorentz transformation (7.17) written 
in terms of the rapidity. 
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Figure 60: The celestial sphere for one Figure 61: ...and for another 

observer... 

7.6.2 What the Observer Actually Observes 

There's a rather nice application of the above formalism. In Section 7.2, when we 
first encountered relativistic phenomena such as length contraction, we stressed that 
different observers ascribe different coordinates to spacetime events. But this is not the 
same thing as what the observer actually sees, for this also involves the time that the 
light took to travel from the event to the observer. So this leaves open the question: 
what does an observer observe? What do Lorentz contracted objects really look like? 
As we will now show, writing the Lorentz group as SL{2, C) gives a wonderfully elegant 
way to answer this question. Moreover, what we will find is somewhat surprising. 

What an observer actually sees are, of course, light rays. As objects move through 
Minkowski space, they emit light which then propagates to the position of the observer. 
We've drawn this in the diagrams, both of which have the observer placed at the origin 
of Minkowski space. We've also drawn the future and past lightcones emitted from the 
origin. 

In the left-hand figure, the observer is assumed to be stationary with time coordinate 
t. At each fixed moment in time, t, the light rays form a sphere S^. This is drawn as 
the red circle in the past lightcone of the diagram. If we assume that no other object 
comes between this sphere and the observer, then the light rays intersecting the sphere 
are a good representation of what the observer actually sees. If he takes a snapshot of 
everything around him with some really super-dooper fancy camera, he would record 
the image on this sphere. This is sometimes given the name of the celestial sphere, 
reflecting the fact that this is how we should think of viewing the night sky (at least if 
the Earth wasn't obscuring half of it). 
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Let's now look at what an observer in a different inertial frame sees. This is shown 
in the right-hand figure. This second observer will also take a snapshot using his fancy 
camera as he passes through the origin. But this new observer's celestial sphere is given 
by null rays that sit at t' — constant. Although it's no longer obvious from the picture, 
we know that the space defined by the intersection of light rays with the constant t' 
hyperplane must still be a sphere simply because all inertial observers are equivalent. 
However, this new celestial sphere is clearly tilted with respect to the previous one. 

The four light rays drawn in the figure intersect both celestial spheres. These light 
rays therefore provide a map between what the two observers sec. This is a map 
between the two celestial spheres, — > S^. Our goal is to construct this map. 

This is where our new mathematical formalism comes in. Any point on a light ray is, 
by definition, at vanishing distance from the origin when measured in the Minkowski 
metric. Equivalently, the 2x2 Hermitian matrix X describing this point must have 
vanishing determinant. But there's a nice way to write down such matrices with zero 
determinant. We introduce a two-component complex vector, with a — 1,2. Then 
we write 

Wl 161 V 

which, by construction, obeys detX = 0. It's simple to check that the most general 
Hermitian matrix X with det X — can be written in this way. Note, however, that 
there's a redundancy in this description, since if we rotate both components of ^ by a 
phase, so that ^ — > e*^^, then X remains unchanged. 

An Aside: The Hopf Map 

In our new notation, the celestial sphere at constant time t is simply given by 

^t^ ^ |^^|2 ^ ^ constant (7.39) 

There's actually some interesting maths in this statement. It's obvious that given 
two complex variables and ^2, the equation (7.39) defines a 3-dimensional sphere S^. 
What's perhaps less obvious, but nonetheless true, is that if we identify all points on 
related hj ^ ^ e*^.^, then we get a 2-dimensional sphere S^. In mathematical language, 
we say that SyU{l) ^ S^. 
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It's simple to write directly the map ^ S^. Given a complex 2- vector, ^, obeying 
= 1, you can define 3 real numbers A;* by 

where o"* are the three Pauli matrices (7.36). Then a little algebra shows that k'^k^ = 1. 
In other words, A;* gives a point on S^. This is map from to is called the Hopf 
map. 

Back to the Real World 

Let's now use these new objects ^ to construct the map between the two celestial 
spheres. A nice fact is that Lorentz transformations act in a natural way on the two- 
component ^. To see this, recall that 

But we can view this as a transformation of ^ itself. We have simply the SL{2, C) 
transformation 

e^AC (7.40) 

However, this is not quite our mapping. We can start with a celestial sphere defined 
by (7.39) and act with a Lorentz transformation. The trouble is that the resulting 
space we get remains the first celestial sphere, just written in the second observer's 
coordinates. We still need to propagate the light rays forward and backwards so that 
they intersect the second celestial sphere. 

To avoid this complication, it's best to think about these celestial spheres in a slightly 
different way. Rather than saying that they are defined at constant time, let's instead 
define them as equivalence classes of light rays. This means that we lose the information 
about where we are along the light ray: we only keep the information about which light 
ray we're talking about. Mathematically, this is very simple: to each ^ we associate a 
single complex number a; e C by 




The map from the celestial sphere — C is known as stereographic projection and 
is shown in the figure. Strictly speaking, u parameterises C U {oo}, with the point 
at infinity included to accommodate the point ^2 = 0, which is the North pole of the 
celestial sphere. This extended complex plane is called the Riemann sphere. 
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Figure 62: The stereographic projection. The southern hemisphere is mapped to inside the 
dotted circle; the northern hemisphere is mapped to outside this circle. 



Now the light rays seen by the first observer are labelled by c<j G C and form a 
celestial sphere. The light rays seen by the second observer are labelled by uo' = ^'1/^2 
and form a different celestial sphere. A Lorentz transformation A e SL(2, C) acts on 
^ as (7.40) which, in terms of a;, reads 



This transformation on the complex plane is known as a Mobius transformation. It's 
simple to see that Mobius transformations form a group. In fact, from what we've seen 
above, you shouldn't be surprised to learn that the group of Mobius transformations is 
SL{2, C), up to a discrete Z2 identification. 

Suppose now that the first observers sees an object on his celestial sphere that traces 
out some shape. After stereographic projection, that will result in a shape on the 
complex plane (perhaps passing through the point at infinity). This appears to the 
second observer to be transformed by (7.41). Upon taking the inverse stereographic 
projection, we will learn what shape the second observer really sees. 

To make progress, we should look at a simple example. And the simplest example is 
for an object which is itself a sphere. This means that, when stationary with respect 
to the first observer, the outline of the object looks like a circle. What does the second 
observer see? To answer this, I'll need to invoke some simple facts about stereographic 
projection and Mobius transformations. Although I won't prove them, they are among 
the most basic properties of these transformations and will be proven in next year's 
Geometry course. The facts are: 

• The stereographic projection maps circles on the sphere to circles or lines on the 




where 




and ad — be — 1 



(7.41) 



plane. 
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• Mobius transformations map circles and lines on the plane to circles or lines on 
the plane. 

Hiding behind these facts is the statement that both maps are conformal, meaning that 
they preserve angles. But, for us, the upshot is that a circle on the first celestial sphere 
is mapped under a Lorentz transformation to a circle on the second. 

Let's pause to take this in. The first observer saw an object which had the shape 
of a circle. Based on the arguments of Lorentz contraction, you might expect that the 
second observer sees a squashed circle, maybe an ellipse. Yet this is not what happens. 
Instead, the second observer also sees a circle! The effects of the time of fiight of 
light completely eliminate the Lorentz contraction. This fact was only realised more 
than 50 years after Einstein's formulation of special relativity when it was discovered 
independently by Terrell and Penrose. It is sometimes said to be the "invisibility of the 
Lorentz contraction" . Note that it doesn't mean that the effects of Lorentz contraction 
that we discussed before are not real. It just means that you don't get to see them if 
you take a picture. 

7.6.3 Spinors 

Finally, we're in a position to explain what the title of this section means. A spinor 
is simply a two-dimensional complex vector ^ which, under a Lorentz transformation 
A e SL{2, C), changes as ^ ^ A^. 

(Some confusing caveats: ^ defined in this way is known as a Weyl spinor. In fact, 
strictly speaking, it is known as a left-handed Weyl spinor. For reasons that I won't go 
into here, we can also define something called a right-handed Weyl spinor by exchanging 
</? — > — (/? in the definition of the boosts (7.38). Then combining a left-handed spinor 
together with a right-handed Weyl spinor gives a four component complex object that 
is called a Dirac Spinor. See, I told you it would be confusing!) 

We've already seen how spinors can be used to describe fight rays. But this is not 
their only use; they have much more a life of their own. Before I describe this, let 
me firstly explain a property that makes it very surprising that spinors have any real 
relevance in the world. This harks back to the observation that SL{2, C) is the double 
cover of the Lorentz group. Suppose that there is some object in the Universe that is 
actually described by a spinor. This means, in particular, that the state of the object 
with ^ is different from the state of the object with — ^. What happens when we rotate 
this object? Well, we've already seen how to enact a rotation using SL{2, C) matrices: 
they are given by (7.37). Except if we're acting on spinors we need to make a decision: 
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do we pick +A or do we pick —Al Because, unlike the action on Minkowski space, 
these two different matrices will result in different states ^ after a rotation. It doesn't 
actually matter which choice we pick, as long as we make one. So let's decide that a 
rotation about an axis acts on a spinor by 



This all seems fine. The surprise comes when we look at what happens if we rotate the 
spinor by 27r. It doesn't come hack to itself. Instead, after a rotation by 27r we find 
^ — > — ^. We have to rotate by 47r to get the spinor to return to itself! 

Wouldn't it be astonishing if there were objects in the Universe which had this 
property: that you could rotate them and find that they didn't come back to themselves. 
This is even more astonishing when you realise that rotating an object is the same thing 
as walking around it. If such objects existed, you would be able to circle them once 
and see that the object sits in a different state just because you walked around it. How 
weird would that be? 

Well, such objects exist. What's more, they're the same objects that you and I 
are made of: electrons and protons and neutrons. All of these particles carry a little 
angular momentum whose direction is described by a spinor rather than a vector. This 
means that Nature makes use of all the pretty mathematics that we've introduced in 
this section. The symmetry group of the Universe we live in is not the Lorentz group 
S0^{1, 3). Instead, it is the double cover SL{2, C). And the basic building blocks of 
matter have subtle and wonderful properties. Turn an electron 360° and it isn't the 
same; turn it 720" and you're back to where you started. If you want to learn more 
about this, you can find deeper explanations in the lecture notes on Quantum Field 
Theory. 
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